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TTLE 

METHOD FOR DETERMINING DNA SEQUENCES 

Cross Reference to Related Patents and Applications 
5 This application is related to inventions described 

in U.S. Patent No. 4 , 833, 332 entitled "Scanning 
Fluorescent Detection Systems", issued to Robertson et 
al., to an application S.N. 07/057566, filed June 12, 
1987 by Prober et al. entitled "Method, System, and 

10 Reagents for DNA Sequencing" (IP-597-A), and to an 

application Serial No. 07/545, 74 fr filed 29 June 1990 by 
Dam et al. entitled "Method for Determining DNA 
Sequences" (IP-0846) . 
Field of the Invention 

1 5 This invention relates to a method for obtaining 

more accurate DNA sequence information. 
Background of the Invention 

DNA sequencing is one of the cornerstone analytical 
techniques of modern molecular biology. The development 
20 of reliable methods for sequencing has lead to great 
advances in the understanding of the organization of 
genetic information and has made possible the 
manipulations of genetic material (i.e., genetic 
engineering) • 

25 There are currently two general methods for 

sequencing DNA: the Maxam-Gilbert chemical degradation 
method [A. M. Maxam et al., Meth. in F.ny.ym. r Vol. 65, 
499-559 (1980)] and the Sanger dideoxy chain termination 
method [F. Sanger, et al., Pror;. Nat, Anad. fini . HSA, 

30 Vol. 74, 5463-5467 (1977)]. A common feature of these 
two techniques is the generation of a set of DNA 
fragments which are analyzed by electrophoresis. The 
techniques differ in the methods used to prepare these 
fragments . 
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With the Maxam-Gilbert technique, DNA fragments are 
prepared through base-specific, chemical cleavage of the 
piece of DNA to be sequenced. The piece of DNA to be 
sequenced is first 5' -end-labeled with 32 P and then 
5 divided into four portions. Each portion is subjected 
to a different set of chemical treatments designed to 
cleave DNA at positions adjacent to a given base (or 
bases) . The result is that all labeled fragments will 
have the same 5' -terminus as the original piece of DNA 

10 and will have 3 '-termini defined by the positions of 

cleavage. This treatment is done under conditions which 
generate DNA fragments which are of convenient lengths 
for separation by gel electrophoresis. 

With Sanger's technique, DNA fragments are produced 

15 through partial enzymatic copying (i.e., synthesis) of 
the piece of DNA to be sequenced. In the most common 
version, the piece of DNA to be sequenced is inserted, 
using standard techniques, into a "sequencing vector", a 
large, circular, single-stranded piece of DNA such as 

20 the bacteriophage M13. This becomes the template for 
the copying process. A short piece of DNA with its 
sequence complementary to a region of the template just 
upstream from the insert is annealed to the template to 
serve as a primer for the synthesis. In the presence of 

2 5 the four natural deoxyribonucleoside triphosphates 

(dNTP's), a DNA polymerase will extend the primer from 
the 3' -end to produce a complementary copy of the 
template in the region of the insert. To produce a 
complete set of sequencing fragments, four reactions are 

3 0 run in parallel, each containing the four dNTP's along 

with a single dideoxyribonucleoside triphosphate (ddNTP) 
terminator, one for each base. ( 32 P-Labeled or 
f luorophore-labelled dNTP is added to afford labeled 
fragments.) If a dNTP is incorporated by the 
3 5 polymerase, chain extension can continue. If the- 
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corresponding ddNTP is selected, the chain is 
terminated. The ratio of ddNTP to dNTP's is adjusted to 
generate DNA fragments of appropriate lengths. Each of 
the four reaction mixtures will, thus, contain a 
5 distribution of fragments with the same 

dideoxynucleoside residue at the 3 1 -terminus and a 
primer-defined 5' -terminus. 

In both the Sanger and Maxam-Gilbert methods, base 
sequence information which generally cannot be directly 

1 0 determined by physical methods has been converted into 
chain-length information which can be determined. This 
determination can be accomplished through electro- 
phoretic separation. Under denaturing conditions (high 
temperature, urea present, etc.), short DNA fragments 

15 migrate as if they were stiff rods. If a gel matrix is 
employed for the electrophoresis, the DNA fragments will 
be sorted by size. The single-base resolution required 
for sequencing can usually be obtained for DNA fragments 
containing up to several hundred bases. 

20 To determine a full sequence, the four sets of 

fragments produced by either Maxam-Gilbert or Sanger 
methodology are subjected to electrophoresis. This 
results in the fragments being spatially resolved along 
the length of the gel. One method of discriminating tl\ j 

25 dyes (which replace the 32 p label) and using this 

information to determine DNA sequences is described in 
the Prober et al. application. This system is available 
in a commercial instrument known as the Genesis™ 2000 
available from E. I. du Pont de Nemours and Company, 

3 0 Wilmington, Delaware. The Genesis™ system for 

sequencing DNA, comprising a means for detecting the 
presence of radiant energy from closely-related yet 
distinguishable reporters or labels, which are 
covalently attached to compounds which function as chain 

3 5 terminating nucleotides in a modified Sanger DNA chain 



elongation method- Distinguishable fluorescent 
reporters are attached to each of the four 
dideoxynucleotide bases represented in Sanger DNA 
sequencing reactions, i.e., dideoxynucleotides of 
adenine, guanine, cytosine, and thymine. These 
reporter-labeled chain terminating reagents are 
substituted for unlabeled chain terminators in the 
traditional Sanger method and are combined in reactions 
with the corresponding deoxynucleotides, an appropriate 
primer, template, and polymerase. The resulting mixture 
contains DNA fragments of varying length that differ 
from each other by one base which terminate on the 3" 
end with uniquely labeled chain terminators 
corresponding to one of the four DNA bases. This new 
labelling method allows elimination of the customary 
radioactive label contained in one of the 
deoxynucleotides of the traditional Sanger method. 

Detection of these reporter labels can be 
accomplished with two stationary photomultiplier tubes 
(PMT's) which receive differing wavelength bands of 
fluorescent emissions from laser-stimulated reporters 
attached to chain terminators on DNA fragments. These 
fragments can be electrophoretically separated in space 
and/or time to move along an axis perpendicular to the 
sensing area of the PMT's. The fluorescent emissions 
first pass through a dichroic or other wavelength 
selective filter or filters, placed so as to direct one 
characteristic wavelength to one PMT, and the other 
characteristic wavelength to the other PMT. In this 
manner, different digital signals are created in each 
PMT that can be ratioed to produce a third signal that 
is unique to a given fluorescent reporter, even if a 
series of fluorescent reporters have closely spaced 
emission wavelengths. This system is capable of 
detecting reporters which are all efficiently excited by 
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a single laser line, such as 488 nm f and which have 
closely spaced emissions whose maxima usually are 
different from each other by only 5 to 7 nm. Therefore, 
the sequential base assignments in a DNA strand of 
5 interest can be made on the basis of the unique ratio 
derived for each of the four reporter-labeled chain 
terminators which correspond to each of the four bases 
in DNA* 



10 fluorescent labels in the Genesis™ 2000 unit, it is 

noted that the information could also be contained in a 
colorimetric label (S. Beck, Anal. Biochem. 164 (2) 514- 
520 (1987)), chemiluminescent (S. Beck, Nnnlfiin ftrcids 
Rgg- 17 5115-5123 (1989)) or other signal. 

15 The Genesis™ DNA sequencer is designed to take 

advantage of the dideoxy chain termination chemistry. 
In order to employ this chemistry, it was necessary to 
use four chemically-similar dyes to distinguish the four 
bases A, C, G, and T. Unless the dyjss are carefully 

20 chosen and exhaustively evaluated, their electrophoretic 
mobility may differ in some DNA sequences, leading to a 
scrambling of sequence information. The four dyes, 
chosen for similar electrophoretic mobility, had 
overlapping emission and excitation spectra. The need 

25 to distinguish these dyes without the excessive light 
loss of extremely narrow-band filters led to a two- 
channel detection scheme, in which the ratio of two 
signals is used to determine which base has pass* 5 ! the 
detector. When peaks are well-resolved and noise-free, 

3 0 the ratiometric signals are easy to interpret (Fig. 1). 
However, to maximize the amount of sequence information 
that can be obtained from each run, it is necessary to 
accurately interpret the two-channel signal under 
conditions of poor peak resolution and significant 

3 5 noise. 



While the base information is contained in 
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The methods for analysis of two-channel data under 
these conditions differ from those used to process 
conventional electrophoretbgrams and chromatograms The 
output of the analysis described here is a sequence of 
5 . base identifications. A, C, G, or T, while in 

chromatography,, the desired output is typically a list 
of peak positions and areas. Chromatographic processes 
generally do not involve two detector signals coupled by 
one of four ratios. This relationship between the two 

1 0 signals is a special property of the sequencer described 
in the Prober et al. patent application. Computational 
efficiency is a more important consideration for 
sequencing than for chromatography. In chromatography, 
useful results can be obtained by performing extensive 

15 computations on two or three peaks; in sequencing, it 
may be necessary to analyze 300 to 600 peaks. 

The ratiometric scheme of Prober et al also 
presents a signal interpretation problem different from 
that of other DMA sequencers. Sequencers employing 

20 primer chemistry are described in [L. M. Smith et al. 

wnm 0 <o AfHric, ia 2399-2412 (1985) and W. Ansorge et 

al. -t THo^m. THonhvs. Meth. 13. 315-323 (1986) J. 
These sequencers employ four signal channels, one for 
each base. Other sequencers, such as that described by 

25 Kambara et al. [H. Kambara et al., Blot finhnplogv £. 816- 
821 (1988)], employ one signal in each of four 
electrophoresis lanes. These systems employ yet another 
class of data analysis methods, since the results from 
four separate lanes must be registered, or aligned, in 

3 0 the proper time sequence. 

In these automated versions of DNA sequencing the 
reporter may be fluorimetric as is described in the 
Prober et al. application, colorimetric (S. Beck, Ana l . 
RinrhPm. 164 (2) 514-520 (1987), chemilumine scent (S. 
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Beck, NnriMr* Reids Res. 17 5115-5123 (1989), or of some 
other type. 

Sequencers employing primer chemistry such as 
(Hunkapiller et al., U.S. 4,811,218) are not so 
5 restricted in the selection of dyes that may be used to 
tag the DNA fragments . These sequencers can employ four 
signal channels, one for each base, and thus do not 
require the complex algorithms needed to interpret 
ratiometric signa s. On the other hand, these 

1 0 sequencers cannot enjoy the advantages of terminator 
chemistry. In particular, primer chemistry r* quires 
four separate reaction tubes for each sample to be 
sequenced, while terminator chemistry requires only one. 
In addition, primer chemistry is susceptible to errors 

15 from "false stops", erroneous signals produced when a 

polymerase is unable to proceed past a certain point on 
a DNA strand. 

Other sequencers, such as that described by Kambara 
et al. (H Kambara et al., Biotechnology £ 816-821 

20 (1988), employ one signal in each of four 

electrophoresis lanes. This overc es many of the 
difficulties encountered with the resolution problem of 
the prior art automated DNA sequencers. These systems 
employ yet another class of data analysis methods, since 

25 the results from four separate lanes must be registered, 
or aligned, in the - roper time sequence. Once the lanes 
are registered, th*. ata analysis methods for these 
sequences can be identical to those of Hunkapiller et 
al. Proper registration for the lanes is obviously 

3 0 crucial to correct sequence determination. If lanes are 
improperly registered, the corresponding bases are 
interpreted out of order. The problem of registration 
of four lanes is complex because it is combinatorial in 
nature. For a given pair of closely spaced bands in all 

3 5 four lanes, there are 41 « 2x3x4 »- 24 possible orderings 
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of the bands. Only one corresponds to the correct 
sequence. The registration process can introduce errors 
in sequence interpretation, and therefore sequencers of 
the type described by Kambara may produce a smaller 
5 . amount of accurate sequence information than those of 
Hunkapiller, given equal resolution and signal-to-noise 
ratio. Note also that these sequences require not only 
four reaction tubes, but also four electrophoresis lanes 
for each DNA sample to be analyzed. 
10 " f Invention 

These twin problems of accuracy or resolution and 
registration are largely overcome by the method of the 
present invention. According to the method of this 
invention the sequence of bases may be determined, i.e., 
15 "called", by using two electrophoresis lanes for each 
sample whose DNA is to be sequenced. Each lane 
incorporates two terminator dyes. According to the 
invention a method for sequencing DNA to ascertain the 
sequence of the four bases G,C,A and .T comprises the 
20 steps of: producing a first set of DNA fragments having 
reporter labeled terminators for two of the four bases, 
each base having a different reporter, producing a 
second set of DNA fragments having reporter labeled 
terminators for the remaining two of the four bases, 
25 each base having a different reporter, separating the 

respective first and second sets by gel electrophoresis, 
generating a first signal for each of the respective 
sets that varies in amplitude to form peaks in 
accordance with a characteristic of the reporter and the 
3 0 rate at which a base moves through the gel, generating a 
second signal for each of the respective sets that 
varies in amplitude to form peaks in accordance with a 
characteristic of the reporter and the rate at which 
each has moved through the gel, and converting the fir-st 
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and second signals to signals corresponding to the four 
bases in the order of their movement through the gel. 

In a preferred embodiment of the invention the 
first set of DNA fragments includes G and C. This tends 
5 to overcome the well-known G-C compression phenomenon in 
which the spacing between G signal peaks and immediately 
subsequent signal peaks is highly variable. Further, in 
accordance with the preferred embodiment of this 
invention, the first and second sets are separated in 

10 adjacent lanes on the gel. This is advantageous since 
the DNA fragments to be separated are subjected to 
essentially the same gel composition, thermal 
environment, and electric field. During sequences, the 
four base signals are ordered according to their rate of 

15 passage through the gel. Also the reporters used and 
the first set may be the same as those used for the 
second set. 

This method has many advantages. Terminator 
chemistry may be used without the complexity of 

20 ratiometric signal processing as described by Prober et 
al. The method requires the use of only two reporters 
with similar electrophoretic mobility and different 
spectral characteristics, rather than the four reporters 
required by a single-lane system. Lane registration is 

25 involved, but it is not combinatorial in nature, since 
there are only two possible orderings of two objects. 
The two signals can simply be "slid" with respect to 
each other, until proper alignments determined, or 
described below. Because neither ratios nor complex 

3 0 registration algorithms are employed, this method is 
capable of producing a larger quantity of accurate 
sequence information from a given sample than the prior 
art methods* The use of two lanes rather than four is 
more economical than alternative four-lane methods for 

3 5 obtaining four separate signals. 
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In still another alternative method, the invention 
may be used with primer labelling. Either Sanger or 
Maxim-Gilbert chemistry may be employed. 

Bt ^o-f no<inripH nn of t-.hp Drawings 
5 The invention may be more easily understood by 

considering the following drawings in connection with 
the description in which: 

Fig. 1 is a flowchart depicting the steps according 
to the method of this invention for sequencing DNA 

10 fragments; 

Fig. 2 is a flowchart depicting the steps used in 
the method for registering the lanes as shown in Fig. 1; 

Fig. 3 are plots of the detected peaks plotted 
against time depicting the manner in which the peaks' 
1 5 spacing is set in order to properly space the several 
peaks ; and 

Fig. 4 is a flowchart depicting the details of the 
base calling step of the flowchart of Fig. 1. 

n»^n^ r.p^^pi-i n n r f » h » Pr-^rrml F.mhori i mftnt . 

20 The method of this invention will be described in 

connection with the Genesis™ 2000 Sequencer which has 
two detectors that provide the output signals of R and t 
as described above. These output signals vary in 
amplitude to form peaks in accordance with the 

25 characteristic of the reporter used with the DNA 

fragments and the time at which a base moves through the 
gel. Using the reagent kits available for use with the 
Genesis™ 2000 Sequencer, for example, DNA samples are 
prepared and two terminator reactions are carried out 

3 0 with fluorescent dideoxy terminators. In one reaction, 
only two terminators, G and C, are included. In the 
other reaction, only the A and T terminators are 
included. This pairing is preferred because of the G-C 
compression phenomenon; the spacing between G signal 

3 5 peaks and immediately subsequent C signal peaks is 
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highly variable. If the G and C terminators are 
included in an single reaction, it will not be necessary 
later for the algorithm which aligns the two lanes to 
take G— C spacing variability into account. The two 
5 reaction mixtures are separated on a sequencing gel. It 
is preferable to place the two mixtures in adjacent 
lanes on the gel, to minimize the misalignment between 
the two lanes. From each lane, two photomultipli 
signals are derived using the Genesis apparatus. 

1 0 The preferred combination of dyes and terminators 

is G-dye 1/ A-dye 1/ C-dye 2, and T-dye 2. Many 
potential choices for dye 1 and dye 2 may be available. 
Dye 1 and dye 2 should be chosen so that there is 
adequate spectral separation to distinguish between 

1 5 them, and so that the strongest and most evenl: 

distributed signals for all four bases are obtained. 
This latter parameter must be determined experimental. y, 
since it depends in a complicated way on the enzyme and 
reaction conditions being employed for sequencing. 

20 The four signals (two photomultiplier signals from 

each of two lanes) must be converted to signals 
corresponding to the four bases in the correct order. 
First, each pair or photomultiplier signals must be 
converted to an orthogonal basis so that one signal 

25 corresponds to one base, and the other signal 

corresponds to the other base. This can be done based 
on the fact that the two phot ©multipliers will respond 
differently to the two bases. Using the G-C lane as a- 
example : 

3 0 Pi = fiQi • G + fici • C 

P2 " *G2 • G + fic2 • C 
where Pi and P2 are the PMT signals, G and C are the 
actual concentrations of dye being sensed when the 
signals are obtained, and the fi's describe the response 

3 5 of the two PMT's to the two dyes. It is then possible 
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to derive signals representing the concentrations of the 
dyes being sensed: 

G - &C2 • Pi ~ B C1 * p 2> / (SC2BG1 ~ 1*01^62) t 1 ^ 
C = fi G 2 • Pi - *C1 • P2) / <BG2fiCl - BgiBc2> 12] 
5 From the other lane, A and T signals can be derived 
identically. 

Next it is necessary to register or align the G-C 
and A-T signals. There are many algorithms known for 
registration of signals, i.e., A. Rosenfeld, et al., 

10 n< T imi t> ^«-»t-«> Prnrox^ng. Academic Press, 297-302 
(1976) . One particular method is described in the 
example below. Once the signals are ordered, the data 
is identical in appearance to that produced by the 
system of U.S. Patent 4,811,218 issued to Hunkapiller et 

15 al. in which each signal corresponds to one base. Any 

means of sequence interpretation applicable to that data 
may be used. 

As shown in the flowchart of Fig. 1, registration 
of the lanes is accomplished by the procedure set forth 

20 in greater detail in the flowchart of Fig. 2. The four 
signals used as input to the two-lane data analysis 
process correspond to the photomultiplier signals in the 
G-C and A-T lanes. These four photomultiplier signals 
are converted into four base signals A,G,C and T using 

25 the equations 1 and 2 above. Next the peaks in the G-C 
lane are located. First, the G and C signals are 
summed. The resulting data array is scanned from 
beginning to end, and any point at which the first 
derivative of the data crossed zero with a slope 

3 0 exceeding a certain threshold (0.00015 V/data point) is 
taken to be a peak location. Thus a list of peaks in 
the G-C channis is obtained. A list of peaks in the A-T 
channel is obtained from the other lane data by an 
identical process. 




WO 92/02635 PCI7US91/05329 

13 

Next the alignment of the two arrays is determined. 
The alignment is carried .»ut on groups of 40 peaks. As 
shown in Figs. 2 and 3, when the two lists of peak 
positions are combined with improper alignment, there is 
5 a wide variance in the spacing between peaks in the 
combined list . When the A-T peaks are shifted by an 
offset value , that places them in their proper 
positions, the spacing variance is reduced. The A-T 
peak positions are shifted with respect to the G-C peak 

1 0 positions until a degree of offset is found that results 
in a minimum variance in peak-to-peak spacing when the 
G-C and A-T peak lists are combined. Using the offsets 
so determined, a new A-T data array is created and 
aligned to the G-C data array. Since the offset varied 

1 5 from one 40-peak block to the next, a linearly 

interpolated offset value is used at each data point. 

In order to facilitate peak calling, data is 
normalized, that is, the arrays are scaled to give 
identical average signal amplitudes .in all four 

2 0 channels . 

Base calling begins with the first peak in any of 
the four arrays, and proceeds down the arrays, one peak 
at a time. Once an expected peak position is determined 
as shown in the flowchart of Fig. 4, an oscillating 
25 search is performed about this position until a peak is 
found. A peak is defined by the zero-crossing criterion 
described above. If only one of the four channels, A, 
C, G, or T, has a peak at that position, the 
corresponding base is called. If two channels have a 

3 0 peak in the same place (due usually to noise in one of 

the channels), the base corresponding to the largest 
peak is called* 

Alternatively, this invention employing two lanes 
and two dyes can be used with primer labelling. In this 
3 5 case, either Sanger or Maxam-Gilbert chemistry may be 
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employed. When the invention is employed in this 
fashion, four reactions, as is the case for primer 
labelling, rather than two must be performed. However, 
these four reactions can be combined to require only two 
5 lanes on the gel, e.g., the G and C pots and the A and T 
pots may be combined. Thus the efficient use of gel 
space and the simplification in lane registration can 
still be exploited, even though primer chemistry is 
being used. 

1 0 The method of this invention produces more accurate 

and greater number of bases called than has heretofore 
been possible. 

Example 

The following example illustrates the principle of 

15 two-lane, two-dye sequencing. In actuality, four dye- 
terminators were used: ddG-SF505, ddA-SF512, ddC-SF519, 
and ddT-SF526. This permitted the use of commercially- 
available labelled terminator reagents from the Genesis 
System. A true two-dye system could utilize dye- 

20 terminator combinations whose synthesis is described in 
Prober et al. application, for example, ddG-SF505, ddA- 
SF505, ddC-SF519, and ddT-SF519 which are those that are 
available commercially in the Genesis™ 1000 kits. The 
samples in this example were run and analyzed in pairs, 

25 in a way that did not involve distinguishing the G dye 
from the A dye, or the C dye from the T dye. It should 
therefore be apparent that the substitution of SF505 for 
SF512, and of SF-519 for SF-526, does not materially 
affect the results. 

3 0 All reagents were commercial products of the 

Biotechnology Systems Division, E. I. du Pont de Nemours 
and Company, Wilmington, DE, except as noted. Three 
micrograms of Ml3mpl8 template DNA were used for each of 
three reactions, a G-C reaction, an A-T reaction, and a 

3 5 control. Each reaction mixture was incubated with 15 ng 
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of 17-mer universal M13 primer oligonucleotide (5 f - 
GTTTTCCCAGTCACGAC-3 9 ) in Sequenase™ buffer (U.S. 
Biochemical Corp., Cleveland, OH) a volume of 21 HI* at 
95°C for 2 minutes to denature the template, and then at 
5 37°C for 10 minutes to anneal the primer. The G-C 

reaction tube then received 2.5 dithiothreitol, 3.0 
|1L of 75 |1M each of dATP, dCTP, dGTP, and dTTP, and 0.5 
(IL of a mix of 1.12 |XM ddGTP-F fluorescent labeled G 
terminator, and 8.96 JIM ddCTP-F fluorescent labeled C 

10 terminator. The A-T reaction tube then received 2.5 flL 
dithiothreitol, 3.0 flL of 75 flM each of dATP, dCTP, 
dGTP, and dTTP, and 0.5 flL of a mix of 3.36 JIM ddATP-F 
fluorescent labeled A terminator, and 112 |1M ddTTP-F 
fluorescent labeled T terminator. The control reaction 

1 5 received all four terminators . Each reaction received 
0.5 JlL of Sequenase™ T7 polymerase (U.S. Biocfc Tiical 
Corp., Cleveland, OH), and was incubated at 42°C for 2 
min. An additional 1.0 JIL of Sequenase™ was added to 
each, and incubated for another 5 min at 42°C. 

20 Unincorporated ddNTP-F was removed from each reaction 
with a Sephadex G-50 Spin Column. 

The G-C, A-T, and control reaction mixtures were 
then loaded in adjacent lanes and electroj esed on a 
Genesis™ 2000 DNA Analysis System (£. I. a- Pont 

25 de Nemours and Company, Wilmington, DE) at 18W using a 
6% polyacrylamide gel (19:1 cros s linking ) . Data were 
collected for 10 hours and analyzed on a Macintosh II 
computer (Apple Corp., Cupertino, CA) using the program 
shown in Fig. 1, and flowcharted in Fig. 2. 

3 0 The data analyzed by this method yielded 501 bases 

of sequence • .formation at 97% accuracy, with 3 
ambiguous calls (no peak located within the expected 
peak interval) . The control reaction was analyzed using 
software supplied with the Genesis™ 2000 DNA Analysis 

35 System (Version 3.0.2), and produced 379 bases at 97% 
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accuracy, with 23 ambiguous calls. Thus a greater 
amount of information was obtained from the template 
utilizing the two-lane, two-dye method. 
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WHAT IS CLAIMED IS: 

1. A method for sequencing DNA to ascertain the 
sequence of the four bases G, C, A and T comprising the 
steps of; 

5 producing a first set of reporter labelled DNA 

fragments having for two of the four bases, each base 
having a different reporter, 

producing a second set of reporter labelled DNA 
fragments having for the remaining two of the four 
1 0 bases, each base having a different reporter, 

separating the respective first and second sets 
by gel electrophoresis, 

generating a first signal for each of the 
respective sets that varies in amplitude to form peaks 
1 5 in accordance with a characteristic of the reporter and 
the rate at which a base moves through the gel, 

generating a second signal for each of the 
respective sets that varies in amplitude to form peaks 
in accordance with a characteristic .of the reporter and 
20 the rate at which each has moved through the gel, and 
converting the first and second signals to 
signals corresponding to the four bases in the order of 
this movement through the gel. 

2. A method according to claim 1 in which the first 
25 set of DNA fragments is G and C. 

3 . A method according to claim 1 in which the first 
and second sets are separated in adjacent lanes on the 
gel. 

4. A method according to claim 3 which includes the 
3 0 step of ordering the four base signals according to 

their rate of passage through the gel. 

5. A method according to claim 1 which includes the 
step of ordering the four base signals according to 
their rate of passage through the gel. 
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6. A method according to claim 2 in which the first 
and second sets are separated in adjacent lanes on the 
gel. 

7. A method according to claim 6 which includes the 
5 step of ordering the four base signals according to 

their rate of passage through the gel. 

8. A method of claim 1 wherein the reporters used 
for the first set are the same as used for the second 
set. 

10 9. A method according to claim 8 in which the first 

set of DNA fragments is G and C. 

10. A method according to claim 3 in which the first 
and second sets are separated in adjacent lanes on the 
gel. 

15 11. A method according to claim 3 which includes the 

step of ordering the four base signals according to 

their rate of passage through the gel. 

12. A method according to claim. 1 wherein the DNA 

fragments have terminators which are labelled. 
20 13. A method according to claim 1 wherein the DNA 

fragments have primers which are labelled. 
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